f 



7. Sequence Listing 

SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Hook, Magnus 

Patti, Joseph M. 
House - Pompeo , Karen 
Sthanam, Narayana 
Symersky, Jindrich 

(ii) TITLE OF INVENTION: COLLAGEN BINDING PROTEIN COMPOSITIONS 
AND METHODS OF USE 

(iii) NUMBER OF SEQUENCES: 8 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Arnold, White & Durkee 

(B) STREET: P.O. Box 4433 

(C) CITY: Houston 

(D) STATE: Texas 

(E) COUNTRY: U.S. 

(F) ZIP: 77210 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US Unknown 

(B) FILING DATE: Concurrently Herewith 

(C) CLASSIFICATION: Unknown 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 60/017,678 

(B) FILING DATE: 16-MAY-1996 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Kitchell, Barbara S. 

(B) REGISTRATION NUMBER: 33,928 

(C) REFERENCE/DOCKET NUMBER: TAMK:193 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (512) 418-3000 

(B) TELEFAX: (512) 474-7577 



(2) INFORMATION FOR SEQ ID NO:l: 



o 
I)) 

111 



ill 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 441 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

ATAACATCTG GGAATAAATC AACGAATGTT ACGGTTCATA AAAGTGAAGC GGGAACAAGT 60 

AGTGTTTTCT ATTATAAAAC GGGAGATATG CTACCAGAAG ATACGACACA TGTACGATGG 12 0 

TTTTTAAATA TTAACAATGA AAAAAGTTAT GTATCGAAAG ATATTACTAT AAAGGATCAG 18 0 

ATTCAAGGTG GACAGCAGTT AGATTTAAGC ACATTAAACA TTAATGTGAC AGGTACACAT 24 0 

AGCAATTATT ATAGTGGACA AAGTGCAATT ACTGATTTTG AAAAAGCCTT TCCAGGTTCT 3 00 

AAAATAACTG TTGATAATAC GAAGAACACA ATTGATGTAA CAATTCCACA AGGCTATGGG 360 

Q 

5 II TCATATAATA GTTTTTCAAT TAACTACAAA ACCAAAATTA CGAATGAACA GCAAAAAGAG 42 0 

m 

lab TTTGTTAATA ATTCACAAGC T 441 

Ml 

m 

IIJ (2) INFORMATION FOR SEQ ID NO : 2 : 



!!! 
iil 
111 
III 

an. 
la* 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 159 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Met Arg Gly Ser His His His His His His Gly Ser lie Thr Ser Gly 
15 10 15 

Asn Lys Ser Thr Asn Val Thr Val His Lys Ser Glu Ala Gly Thr Ser 
20 25 30 

Ser Val Phe Tyr Tyr Lys Thr Gly Asp Met Leu Pro Glu Asp Thr Thr 
35 40 45 

His Val Arg Trp Phe Leu Asn lie Asn Asn Glu Lys Ser Tyr Val Ser 
50 55 60 

Lys Asp lie Thr lie Lys Asp Gin lie Gin Gly Gly Gin Gin Leu Asp 
65 70 75 80 

Leu Ser Thr Leu Asn lie Asn Val Thr Gly Thr His Ser Asn Tyr Tyr 
85 90 95 
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Ser Gly Gin Ser Ala lie Thr Asp Phe Glu Lys Ala Phe Pro Gly Ser 
100 105 110 

Lys lie Thr Val Asp Asn Thr Lys Asn Thr lie Asp Val Thr lie Pro 
115 120 125 

Gin Gly Tyr Gly Ser Tyr Asn Ser Phe Ser lie Asn Tyr Lys Thr Lys 
130 135 140 

lie Thr Asn Glu Gin Gin Lys Glu Phe Val Asn Asn Ser Gin Ala 
145 150 155 



(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

/ ? (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

[It GACGATAAAA ATGGAAAAAT ACAAAATGGT GACATGATTA AAGTGGCATG GCCGACAAGC 60 



ill 



GGTACAGTAA AGATAGAGGG TTATAGTAAA ACAGTACCAT TAACTGTTAA AGGTGAACAG 120 

H GTGGGTCAAG CAGTTATTAC AC CAGACGGT GCAACAATTA CATTCAATGA TAAAGTAGAA 180 

□ 

lij AAATTAAGTG ATGTTTCGGG ATTTGCAGAA TTTGAAGTAC AAGGAAGAAA TTTAACGCAA 240 

! I J 

|1| ACAAATACTT CAGATGACAA AGTAGCTACG ATAACATCTG GGAATAAATC AACGAATGTT 300 

O - 

j^L ACGGTTCATA AAAGTGAAGC GGG AACAAGT AGTGTTTTCT ATTATAAAAC GGGAGATATG 360 

CTACCAGAAG ATACGACACA TGTACGATGG TTTTTAAATA TTAACAATGA AAAAAGTTAT 420 

GTATCGAAAG ATATTACTAT AAAGG AT CAG ATT CAAGGTG GACAGCAGTT AGATTTAAGC 480 

ACATTAAACA TTAATGTGAC AGGTACACAT AGCAATTATT ATAGTGGACA AAGTGCAATT 540 

ACTGATTTTG AAAAAGCCTT TCCAGGTTCT AAAATAACTG TTGATAATAC GAAGAACACA 600 

ATTGATGTAA CAATTCCACA AGGCTATGGG TCATATAATA GTTTTTCAAT TAACTACAAA 660 

ACCAAAATTA CGAATGAACA GCAAAAAGAG TTTGTTAATA ATTCACAAGC TTGGTATCAA 720 

GAGCATGGTA AGGAAGAAGT GAACGGGAAA TCATTTAATC ATACTGTGCA CAATATTAAT 780 

GCTAATGCCG GTATTGAAGG TACTGTAAAA GGTGAATTAA AAGTTTTAAA ACAGGATAAA 840 

GATACCAAG 849 
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(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 211 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

Met Arg Gly Ser His His His His His His Gly jser Asp Asp Lys Val 
1 5 10 15 

Ala Thr lie Thr Ser Gly Asn Lys Ser Thr Asn Val Thr Val His Lys 
20 25 30 

Ser Glu Ala Gly Thr Ser Ser Val Phe Tyr Tyr Lys Thr Gly Asp Met 
35 40 45 

Leu Pro Glu Asp Thr Thr His Val Arg Trp Phe Leu Asn lie Asn Asn 
50 55 60 

Glu Lys Ser Tyr Val Ser Lys Asp lie Thr lie Lys Asp Gin lie Gin 
65 70 75 80 

Gly Gly Gin Gin Leu Asp Leu Ser Thr Leu Asn lie Asn Val Thr Gly 
85 90 95 



\i$ Thr His Ser Asn Tyr Tyr Ser Gly Gin Ser Ala lie Thr Asp Phe Glu 

IIS 100 105 110 

ill 

s** Lys Ala Phe Pro Gly Ser Lys lie Thr Val Asp Asn Thr Lys Asn Thr 

7- 115 120 125 

lie Asp Val Thr He Pro Gin "Gly Tyr Gly Ser Tyr Asn Ser Phe Ser 
130 135 140 

'* 

He Asn Tyr Lys Thr Lys He Thr Asn Glu Gin Gin Lys Glu Phe Val 
145 150 155 160 

Asn Asn Ser Gin Ala Trp Tyr Gin Glu His Gly Lys Glu Glu Val Asn 
165 170 175 

Gly Lys Ser Phe Asn His Thr Val His Asn He Asn Ala Asn Ala Gly 
180 L85 190 

He Glu Gly Thr Val Lys Gly Glu Leu Lys Val Leu Lys Gin Asp Lys 
195 200 205 

Asp Thr Lys 
210 
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(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1500 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

GCACGAGATA TTTCATCAAC GAATGTTACA GATTTAACTG TATCACCGTC TAAGATAGAA 60 

GATGGTGGTA AAACGACAGT AAAAATGACG TTCGACGATA AAAATGGAAA AATACAAAAT 120 

GGTGACATGA TTAAAGTGGC ATGGCCGACA AGCGGTACAG TAAAGATAGA GGGTTATAGT 180 

AAAACAGTAC CATTAACTGT TAAAGGTGAA CAGGTGGGTC AAGCAGTTAT TACACCAGAC 240 

GGTGCAACAA TTACATTCAA TGATAAAGTA GAAAAATTAA GTGATGTTTC GGGATTTGCA 300 

GAATTTGAAG TACAAGGAAG AAATTTAACG CAAACAAATA CTTCAGATGA CAAAGTAGCT 3 60 

ACGATAACAT CTGGGAATAA ATCAACGAAT GTTACGGTTC ATAAAAGTGA AGCGGGAACA 420 

AGTAGTGTTT TCTATTATAA AACGGGAGAT ATGCTACCAG AAGATACGAC ACATGTACGA 4 80 

TGGTTTTTAA ATATTAACAA TGAAAAAAGT TATGTATCGA AAGATATTAC TATAAAGGAT 540 

CAGATTCAAG GTGGACAGCA GTTAGATTTA AGCACATTAA ACATTAATGT GACAGGTACA 600 

CATAGCAATT ATTATAGTGG ACAAAGTGCA ATTACTGATT TTGAAAAAGC CTTTCCAGGT 660 

TCTAAAATAA CTGTTGATAA TACGAAGAAC ACAATTGATG TAACAATTCC ACAAGGCTAT 720 

GGGTCATATA ATAGTTTTTC AATTAACTAC AAAACCAAAA TTACGAATGA ACAGCAAAAA 780 

GAGTTTGTTA ATAATTCACA AGCTTGGTAT CAAGAGCATG GTAAGGAAGA AGTGAACGGG 840 

AAATCATTTA ATCATACTGT GCACAATATT AATGCTAATG CCGGTATTGA AGGTACTGTA 900 

AAAGGTGAAT TAAAAGTTTT AAAACAGGAT AAAGAT AC CA AGGCTCCTAT AGCTAATGTA 960 

AAATTTAAAC TTTCTAAAAA AGATGGATCA GTTGTAAAGG ACAATCAAAA AGAAATTGAG 1020 

ATTATAACAG ATGCAAACGG TATTGCTAAT ATTAAAGCGT TGCCTAGTGG AGACTATATT 1080 

TTAAAAGAAA TAGAGGCGCC ACGACCGTAT ACATTTGATA AGGATAAAGA ATATCCGTTT 1140 

ACTATGAAAG ATACAGATAA TCAGGGATAT TTTACGACTA TTGAAAATGC AAAAGCGATA 1200 

GAAAAAACAA AAGATGTTTC TGCTCAAAAG GTTTGGGAAG GCACTCAAAA AGTGAAACCA 1260 
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ACGATTTATT TCAAGTTGTA CAAACAAGAT GACAATCAAA ATACAACACC AGTAGACAAA 132 0 

GC AG AG ATT A AAAAATTAGA AGATGGAACG ACAAAAGTGA CATGGTCTAA TCTTCCGGAA 1380 

AATGACAAAA ATGGCAAGGC TATTAAATAT TTAGTTAAAG AAGTAAATGC TCAAGGTGAA 144 0 

GATACAACAC CAGAAGGATA TACTAAAAAA GAAAATGGTT TAGTGGTTAC TAATACTGAA 1500 



(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 512 amino acids 

(B) TYPE: amino acid 

(C) STRAND ED NESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Arg Gly Ser His His His His His His Gly Ser Ala Arg Asp He 
15 10 15 

Ser Ser Thr Asn Val Thr Asp Leu Thr Val Ser Pro Ser Lys He Glu 
20 25 30 

Asp Gly Gly Lys Thr Thr Val Lys Met Thr Phe Asp Asp Lys Asn Gly 
35 40 45 

Lys He Gin Asn Gly Asp Met He Lys Val Ala Trp Pro Thr Ser Gly 
50 55 60 

Thr Val Lys He Glu Gly Tyr Ser Lys Thr Val Pro Leu Thr Val Lys 
65 70 75 80 

Gly Glu Gin Val Gly Gin Ala Val He Thr Pro Asp Gly Ala Thr He 
85 90 95 

Thr Phe Asn Asp Lys Val Glu Lys Leu Ser Asp Val Ser Gly Phe Ala 
100 105 110 

Glu Phe Glu Val Gin Gly Arg Asn Leu Thr Gin Thr Asn Thr Ser Asp 
115 120 125 

Asp Lys Val Ala Thr He Thr Ser Gly Asn Lys Ser Thr Asn Val Thr 
130 135 140 

Val His Lys Ser Glu Ala Gly Thr Ser Ser Val Phe Tyr Tyr Lys Thr 
145 150 155 160 

Gly Asp Met Leu Pro Glu Asp Thr Thr His Val Arg Trp Phe Leu Asn 
165 170 175 

He Asn Asn Glu Lys Ser Tyr Val Ser Lys Asp He Thr He Lys Asp 
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180 



185 



190 



Gin lie Gin Gly Gly Gin Gin Leu Asp Leu Ser Thr Leu Asn lie Asn 
195 200 205 

Val Thr Gly Thr His Ser Asn Tyr Tyr Ser Gly Gin Ser Ala lie Thr 
210 215 220 

Asp Phe Glu Lys Ala Phe Pro Gly Ser Lys lie Thr Val Asp Asn Thr 
225 230 235 240 

Lys Asn Thr lie Asp Val Thr lie Pro Gin Gly Tyr Gly Ser Tyr Asn 
245 250 255 

Ser Phe Ser lie Asn Tyr Lys Thr Lys lie Thr Asn Glu Gin Gin Lys 
260 265 270 

Glu Phe Val Asn Asn Ser Gin Ala Trp Tyr Gin Glu His Gly Lys Glu 
275 280 285 

Glu Val Asn Gly Lys Ser Phe Asn His Thr Val His Asn lie Asn Ala 
290 295 300 

Asn Ala Gly lie Glu Gly Thr Val Lys Gly Glu Leu Lys Val Leu Lys 
305 310 315 320 

Gin Asp Lys Asp Thr Lys Ala Pro lie Ala Asn Val Lys Phe Lys Leu 
325 330 335 

Ser Lys Lys Asp Gly Ser Val Val Lys Asp Asn Gin Lys Glu lie Glu 
340 345 350 

lie lie Thr Asp Ala Asn Gly lie Ala Asn lie Lys Ala Leu -Pro Ser 
355 360 365 

Gly Asp Tyr lie Leu Lys Glu lie Glu Ala Pro Arg Pro Tyr Thr Phe 
370 375 380 

Asp Lys Asp Lys Glu Tyr Pro Phe Thr Met Lys Asp Thr Asp Asn Gin 
385 390 395 400 

Gly Tyr Phe Thr Thr lie Glu Asn Ala Lys Ala lie Glu Lys Thr Lys 
405 410 415 

Asp Val Ser Ala Gin Lys Val Trp Glu Gly Thr Gin Lys Val Lys Pro 
420 425 430 

Thr lie Tyr Phe Lys Leu Tyr Lys Gin Asp Asp Asn Gin Asn Thr Thr 
435 440 445 

Pro Val Asp Lys Ala Glu lie Lys Lys Leu Glu Asp Gly Thr Thr Lys 
450 455 460 
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Val Thr Trp Ser Asn Leu Pro Glu Asn Asp Lys Asn Gly Lys Ala lie 
465 470 475 480 

Lys Tyr Leu Val Lys Glu Val Asn Ala Gin Gly Glu Asp Thr Thr Pro 

485 490 495 

Glu Gly Tyr Thr Lys Lys Glu Asn Gly Leu Val Val Thr Asn Thr Glu 

500 505 510 



(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

S II Met Arg Gly Ser His His His His His His Gly Ser Met Val Ala Ala 

ill 15 10 15 

i ' ■ " 

jjj Asp Ala Pro Ala Ala Gly Thr Asp lie Thr Asn Gin Leu Thr Asn Val 

ij5 20 25 30 

lis 

j = | Thr Val Gly lie Asp Ser Gly Thr Thr Val Tyr Pro His Gin Ala Gly 

** 35 40 45 

Tyr Val Lys Leu Asn Tyr Gly Phe Ser Val Pro Asn Ser Ala Val Lys 
50 55 60 

Gly Asp Thr Phe Lys lie Thr Val Pro Lys Glu Leu Asn Leu Asn Gly 
65 70 75 80 

Val Thr Ser Thr Ala Lys Val Pro Pro lie Met Ala Gly Asp Gin Val 
85 90 95 

Leu Ala Asn Gly Val lie Asp Ser Asp Gly Asn Val lie Tyr Thr Phe 
100 105 110 

Thr Asp Tyr Val Asn Thr Lys Asp Asp Val Lys Ala Thr Leu Thr Met 
115 120 125 

Pro Ala Tyr lie Asp Pro Glu Asn Val Lys Lys Thr Gly Asn Val Thr 
130 135 140 

Leu Ala Thr Gly lie Gly Ser Thr Thr Ala Asn Lys Thr Val Leu Val 
145 150 155 160 

Asp Tyr Glu Lys Tyr Gly Lys Phe Tyr Asn Leu Ser lie Lys Gly Thr 
165 170 175 



Ui 

ill 
ill 
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lie Asp Gin lie Asp Lys Thr Asn Asn Thr Tyr Arg Gin Thr lie Tyr 
180 185 190 

Val Asn Pro Ser Gly Asp Asn Val lie Ala Pro Val Leu Thr Gly Asn 
195 200' 205 

Leu Lys Pro Asn Thr Asp Ser Asn Ala Leu lie Asp Gin Gin Asn Thr 
210 215 220 

Ser lie Lys Val Tyr Lys Val Asp Asn Ala Ala Asp Leu Ser Glu Ser 
225 230 235 240 

Tyr Phe Val Asn Pro Glu Asn Phe Glu Asp Val Thr Asn Ser Val Asn 
245 250 255 

lie Thr Phe Pro Asn Pro Asn Gin Tyr Lys Val Glu Phe Asn Thr Pro 
260 265 270 

Asp Asp Gin He Thr Thr Pro Tyr He Val Val Val Asn Gly His He 
Q 275 280 285 

Asp Pro Asn Ser Lys Gly Asp Leu Ala Leu Arg Ser Thr Leu Tyr Gly 
H 290 295 300 

ui 

I3J Tyr Asn Ser Asn He He Trp Arg Ser Met Ser Trp Asp Asn Glu Val 

HI 305 310 315 320 

Ala Phe Asn Asn Gly Ser Gly Ser Gly Asp Gly He Asp Lys . Pro Val 

i«\ 325 330 335 



a 



Val Pro Glu Gin Pro Asp Glu Gin Ala 
340 345 



(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 139 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

, (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Met Arg Gly Ser His His His His His His Gly Ser Glu Gly Gly Gin 
1 5 .10 15 

Asn Ser Gly Asn Gin Ser Phe Glu Glu Asp Thr Glu Glu Asp Lys Pro 
20 25 30 

Lys Tyr Glu Gin Gly Gly Asn He Val Asp He Asp Phe Asp Ser Val 
35 40 45 
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Pro Gin lie His 
50 

Thr Glu Lys Asp 
65 

He Asp Phe Asp 



Glu He He Glu 
100 

Gly Gly His Asn 
115 

Ser Gly Gin Asn 
130 



Gly Gin Asn Lys 
55 

Lys Pro Lys Tyr 
70 

Ser Val Pro His 
85 

Glu Asp Thr Asn 



Ser Val Asp Phe 
120 

Glu Phe Asp He 
135 



Gly Asn Gin Ser 
60 

Glu His Gly Gly 
75 

He His Gly Phe 
90 

Lys Asp Lys Pro 
105 

Glu Glu Asp Thr 



Lys Leu Asn 



Phe Glu Glu Asp 



Asn He He Asp 
80 

Asn Lys His Thr 

Ser Tyr Gin Phe 
110 

Leu Pro Lys Val 
125 
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